Hybrid Model for Preprocessing and Clustering of Web Server Log

نویسندگان

  • T. Subha Mastan Rao
  • Sujata Pradhan
چکیده

With increased rate in the usage of the World Wide Web (www) is growing both in its complexity and the volume of traffic of web site, it has become very important to analyze this web traffic and the usage of the web site by the users. Web usage mining is a main research area in web mining focused on learning about web users and their interaction with web sites. The information like server log, browser cookies and other relative information can be used to find user’s access models automatically and quickly from the web log data, such as most frequent access parts, least recent access page group and user cluster[1][2]. In this paper we are analyzing the system by implementing the major recommendation approaches and preprocessing of log files stored in web server by applying clustering algorithm. Based on that, we are extracting valuable information along with the behavior of interested users and web designer to enhance better foundation for decision making of an organization and better service to the customer and web users Keyword-: Clustering approach, Knowledge set, Preprocessing and, Server log file, Web usage mining

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A density based clustering approach to distinguish between web robot and human requests to a web server

Today world's dependence on the Internet and the emerging of Web 2.0 applications is significantly increasing the requirement of web robots crawling the sites to support services and technologies. Regardless of the advantages of robots, they may occupy the bandwidth and reduce the performance of web servers. Despite a variety of researches, there is no accurate method for classifying huge data ...

متن کامل

Data Preprocessing: A Milestone of Web Usage Mining

-.Internet is today full of structured or unstructured information. and this information is directly or indirectly influencing society or peoples. Because today internet is part our daily life activity. But using this abundant and ambiguous in most efficient manner in useful decision making is still a big challenge. During our web surfing either it is online shopping or blogging or using tweets...

متن کامل

An Efficient Algorithm for Data Cleaning of Log File using File Extensions

World Wide Web is a monolithic repository of web pages that provides the Internet users with heaps of information. With the growth in number and complexity of Websites, the size of web has become massively large. Web Usage Mining is a division of web mining that involves application of mining techniques to web server logs in order to extract the behavior of users. A Web Usage Mining process com...

متن کامل

Perplexities in Discovering Navigation Patterns from Server Log

Web navigation patterns discovered from usage data can be used to build prediction model to recommend interesting web pages to the users. A user session may have one or more transactions. Identification of transactions or user behaviors from session data is difficult because web pages cannot be classified strictly as navigation or content pages. In order to identify transactions from log data, ...

متن کامل

Clustering of Web Usage Data Using Fuzzy Tolerance Rough Set Similarity and Table Filling Algorithm

Web Usage Mining is the application of data mining techniques to learn usage patterns from Web server log file in order to understand and better serve the requirements of web based applications. Web Usage Mining includes three most important steps namely Data Preprocessing, Pattern discovery and Analysis of the discovered patterns. One of the most important tasks in Web usage mining is to find ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013